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Abstract 

In this paper we present the open-source WikiPathways app for Cytoscape ( 
http://apps.cytoscape.org/apps/wikipathways) that can be used to import 
biological pathways for data visualization and network analysis. WikiPathways 
is an open, collaborative biological pathway database that provides fully 
annotated pathway diagrams for manual download or through web services. 
The WikiPathways app allows users to load pathways in two different views: as 
an annotated pathway ideal for data visualization and as a simple network to 
perform computational analysis. An example pathway and dataset are used to 
demonstrate the functionality of the WikiPathways app and how they can be 
combined and used together with other apps. More than 3000 downloads in the 
first 12 months following its release in August 2013 highlight the importance 
and adoption of the app in the network biology field. 
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i Amendments from Version 1 

Based on the comments of reviewer 1 , we adapted the figures 
and fixed a typo in the pathway. Furthermore, we extended the 
conclusion based on the comments of reviewer 2. 

Additionally we included the reference to the BridgeDb App paper 
that was recently published. Therefore some of the reference 
numbers have changed. 

See referee reports 



Introduction 

Pathways are commonly used as models for understanding bio- 
logical processes. WikiPathways 1 is an open, collaborative, wiki- 
based website for the curation of biological pathways that are more 
than just images. WikiPathways provides easy-to-use drawing and 
annotation tools to capture identities, relationships, comments and 
literature references for each pathway element and interaction. 
Contributed pathways are displayed like articles at WikiPathways 
and can be downloaded manually or programmatically through web 
services. This opens the possibility for pathway information to be 
accessed by other software tools for data visualization, computa- 
tional analysis and the interpretation of large-scale experimental 
data. 

Utilizing WikiPathways web services, we developed an app for 
Cytoscape 2 , a network visualization and analysis software plat- 
form. The app queries and imports pathways from WikiPathways 
within the Cytoscape environment. Cytoscape's core concepts 
are networks (nodes and edges), tables (rows and columns) and 
styles, which map table values to the visual properties of networks. 
Cytoscape leverages a rich ecosystem of apps to provide additional 
domain- specific semantics and data types, as well as custom visu- 
alization and analysis capabilities. With the WikiPathways app, we 
implemented two ways to represent a pathway as a Cytoscape net- 
work. In the first way, pathways are loaded with the complete visual 
appearance of the original at WikiPathways, including graphical 
annotations and labels. Once in Cytoscape, experimental data can 
be loaded as tables and visually mapped onto these pathway- style 
networks to provide biological context. In the second way, path- 
ways are loaded as simplified networks, focusing on the biological 
entities and their interactions without any of the graphical elements 
of the original pathway diagram. The basic network style is ideal 
for topological analyses, network merging and automatic layout. 

In this paper we present the implementation and usage of the WikiP- 
athways app for Cytoscape. By bringing pathways into Cytoscape 
using the WikiPathways app, it is possible to make full use of 
pathway models with custom visualizations and computational 
analyses. 

Implementation 

The WikiPathways app was developed for Cytoscape 3, which 
introduced a completely new software architecture. The new archi- 
tecture is built on top of Open Service Gateway Initiative (OSGi) 3 , a 
software framework of pluggable modules and services. To be able 



to take advantage of the new architecture (Cytoscape API version 
3.0.0), the predecessor to the WikiPathways app, the GPML Plugin, 
had to be rewritten. 

Pathway import 

The WikiPathways app employs the new architecture of Cytoscape 
in two ways. First, the app exports a user interface that can query 
and import pathways from the WikiPathways web service. Thanks 
to the service architecture in Cytoscape, this interface is seamlessly 
incorporated into Cytoscape's "Import from Public Databases" 
dialog. Second, the app provides an API for programmatic access to 
the WikiPathways web services and the GPML file importer. Other 
apps can use the API to make queries to the WikiPathways web ser- 
vices and import GPML files without having to bundle the WikiP- 
athways app. When the WikiPathways app is loaded in Cytoscape, 
the app registers the implementation of its API with the OSGi mod- 
ule system. Other apps can then request the API implementation 
through OSGi. 

Visualization 

The new architecture also posed new challenges that required us to 
innovate with respect to visual styles. The new architecture includes 
a revamped model to represent networks. This model decouples the 
network topology and table data from its visual style. Visual styles 
constitute Cytoscape's view model. When a node or edge is cre- 
ated in the network model, its view object is only created after a 
triggering of an event. Cytoscape does this to avoid redrawing of 
the network canvas while an app is still in process of building the 
network. Indeed, as the WikiPathways app reads a GPML file, it 
creates a series of nodes and edges in a network to represent the 
pathway. During this process, the app needs to assign visual styles 
to the nodes and edges it creates. However, as new nodes and edges 
are being added to the network, their view objects do not exist yet, 
making it impossible to assign their visual styles. To address this 
issue, we created a class called DelayedVizProp that stores our 
desired visual styles for nodes and edges. Once the network has 
been fully built, the app tells Cytoscape to create the view objects 
for the new nodes and edges. After that, the app looks through the 
DelayedVizProp instances and assigns nodes and edges their 
desired visual style. 

Dependencies 

The app relies on the PathVisio core library 4 to read GPML files. 
The PathVisio library is included in the app. In previous versions 
of Cytoscape, apps that included libraries often conflicted with 
each other. Users had to painstakingly uninstall conflicting apps for 
Cytoscape to become usable again. OSGi solves this problem by 
insulating Cytoscape modules and apps from each other. Due to 
OSGi's architecture in Cytoscape 3, the integrated PathVisio library 
is hidden from other apps and modules in Cytoscape and cannot 
conflict with them. 

The app also uses the Apache HTTP Client library to make HTTP 
requests to the WikiPathways REST server. We avoided the Java 
built-in HTTP client class (java.net.HttpURLConnection), 
which is used frequently in Cytoscape and other apps. This class 
does not support cancellation. Proper cancellation is important for 
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a responsive user interface. Users behind an interrupted internet 
connection should be able to back out of a WikiPathways request 
and return to Cytoscape. Each HTTP request is wrapped in a task, 
a unit of work in Cytoscape. When the user clicks cancel during the 
task execution, the app terminates the underlying HTTP request by 
calling the abort method in the Apache HTTP Client library. 

Results 

The WikiPathways app in Cytoscape provides convenient access to 
the community-curated collection of biological pathways at WikiP- 
athways. The functionality of the app is demonstrated here using 
the human Cardiac Hypertrophic Response pathway from WikiP- 
athways (http://wikipathways.org/instanceAVP2795) combined with 
an unpublished RNA-seq dataset that reflects gene expression lev- 
els during differentiation of cardiac stem cells (see Dataset 1). The 
logFC from timepoint 6 hrs vs control is visualized on the pathway. 
The human Cardiac Hypertrophic Response pathway contains gene 
products and metabolites involved in the intracellular signal-trans- 
duction pathways that coordinate Cardiac Hypertrophic Response. 
As described above, the WikiPathways app allows users to load 
pathways in two different views, as an annotated pathway and as 
a simple network (see Figure 1 and Figure 2). The example dataset 
and pathway will be used to explain how both views can be used 
in Cytoscape. 



When loaded as a pathway, the precise layout of elements is identi- 
cal to its representation at WikiPathways. The graphical elements, 
like labels and shapes, are included in the model in Cytoscape. As 
a pathway diagram, the full representation of biological informa- 
tion is visually preserved, which is ideal for providing a mean- 
ingful context for data visualization. Figure 1 shows the Cardiac 
Hypertrophic Response pathway loaded as an annotated pathway 
in Cytoscape. The Entrez Gene identifiers in the pathway were 
mapped to Ensembl using another app called BridgeDb 5 (http:// 
apps.cytoscape.org/apps/bridgedb) to match the identifiers used 
in the example dataset. The cardiac stem cell tissue development 
expression data can then be loaded, integrated and visualized on 
the pathway nodes (c.f. Introduction to Cytoscape tutorial, http:// 
opentutorials.cgl.ucsf.edu/index.php/Tutorial:Introduction\_to\_ 
Cytoscape\_3 . 1 -part2). 

When loaded as a network, all graphical annotations are removed 
and redundant nodes in the pathway are merged into one unique 
node in the network. Groups and complex interactions are visual- 
ized as very small nodes and a forced directed layout is applied. As 
an abstracted network graph, the same molecular relationships in 
the pathways can be made available for network analysis and aug- 
mentation. Figure 2A shows the Cardiac Hypertrophic Response 
pathway loaded as a network in Cytoscape. This simple network 
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Figure I.The Cardiac Hypertrophic Response pathway loaded as a pathway. LogFC values are visualized as node fill color with a color 
gradient from blue over white to red. Significant measurements (adjusted p-value < 0.05) are highlighted with a green border color. Elements 
in the pathway without a measurement are colored grey. 



Page 4 of 1 5 



FIOOOResearch 2014, 3:152 Last updated: 19 SEP 2014 



Ck -r 






FillColor: logFC 



Figure 2. The Cardiac Hypertrophic Response pathway loaded as a network. (A) The simple network does not contain graphical 
annotations of the pathway. (B) NetworkAnalyzer was used to visualize node degree and betweenness of the nodes in the network to identify 
important hub nodes. (C) The logFC of the example dataset is visualized as node fill color with a gradient (blue over white to red) and the 
adjusted p-value < 0.05 is highlighted with a green border color. (D) jActiveModules finds active subnetworks (highlighted in purple) that are 
affected by varying gene expression. 



structure enables researchers to use other Cytoscape features and 
apps to merge two pathways, apply different layouts to the network 
or extend the pathway, for example, with regulatory interactions 
(CyTargetLinker 6 , http://apps.cytoscape.org/apps/cytargetlinker). It 
also enables users to investigate the topology of the network, like 
calculating degree and betweenness of the nodes with Cytoscape' s 
built-in NetworkAnalyzer tool to identify important hub nodes, see 
Figure 2B. Cytoscape also allows the visualization of experimental 



data in the network, as in Figure 2C which shows the cardiac stem 
cell tissue development expression data. There are several apps 
available for Cytoscape that provide methods that use experimen- 
tal data to cluster nodes in the network (clusterMaker2, http:// 
apps.cytoscape.org/apps/clustermaker2) or find subregions in the 
network affected by varying gene expression (jActiveModules, 
http://apps.cytoscape.org/apps/jactivemodules) as highlighted in 
Figure 2D. 
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Dataset 1 . Dataset studying differentiation of cardiac stem cells 
http://dx.doi.Org/1 0.5256/f 1 000research.4254.d2841 5 

This is a subset of an unpublished RNA-seq dataset containing 
measurements for all genes in the selected Cardiac Hypertrophy 
Response pathway comparing "time point 6 hrs vs. control". 
The dataset contains logFC, p-value and adjusted p-value 
measurements for every gene in the pathway. 



Conclusions 

In this paper we presented the WikiPathways app for Cytoscape, 
which imports biological pathways as curated diagrams or as basic 
node-and-edge networks into Cytoscape. The process of transform- 
ing an arbitrary XML format like GPML into even a basic import 
format for Cytoscape is impractical without this dedicated app. The 
WikiPathways app thus provides researchers with a new, conveni- 
ent method for accessing pathway information. Furthermore, as 
shown in the examples above, the app makes full use of the pathway 
models, allowing researchers to perform computational analyses 
and custom visualizations in conjunction with experimental data 
and network topology. 

Software availability 

Software available from the Cytoscape App Store: http://apps. 
cy to scape . org/app s/ wikip ath way s 

Latest source code: https://github.com/wikipathways/cytoscape-wikip- 
athways-app 

Source code as at the time of publication: https://github.com/F1000Re- 
search/cytoscape-wikipathways-app 

Archived source code as at the time of publication: http://dx.doi. 
org/10.528 l/zenodo.10204 8 



License: Apache License, Version 2.0: http://www.apache.org/ 
licenses/LICENSE-2.0.html 

Data availability 

FIOOOResearch: Dataset 1. Dataset studying differentiation of car- 
diac stem cells, 10.5256/fl000research.4254.d28415 7 
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Supplementary materials 
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Supplementary Figure 1. Human Cardiac Hypertrophic Response pathway from WikiPathways used in the manuscript: http:// 
wikipathways.org/instance/WP2795_r76316. 
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Supplementary Figure 4. Figure 2C in the Wiki Pathways App paper including all graphical details, e.g. Labels. 
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Supplementary Figure 5. Figure 2D in the Wiki Pathways App paper including all graphical details, e.g. Labels. 
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Referee Responses for Version 1 

Ted Goldstein 

Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA 

Approved: 04 August 2014 

Referee Report: 04 August 2014 
doi:1 0.5256/f 1 000research.4555.r5523 

You don't really make the point in the conclusions of the key findings of this excellent work which are: 

1 . It is now possible to do something quickly that before was tedious and therefore would not be done 
by anyone, except the most determined and experienced Cytoscape user. 

2. It is now possible to do something that could not be done before because you needed to add new 
functionality. 

I think it would be clearer that these are the main results if you add the following to the results section: 

I suggest that you make a before and after video. The before video is the experience of using Cytoscape, 
manually downloading the data from Wikipathways, failing to see the the nodes because it does not have 
the DelayedVizProp class etc. 

Then the after video shows the full experience. 

The elapsed time of the before video is something like 1 0 minute while the after video is something like 1 
or 2 minutes. This will clearly demonstrate the improvement in the user experience. Please report these 
time numbers in the text in the results section and possibly summarize the numbers in the abstract. 

Of course, also post these videos to both FIOOOResearch and Youtube as they represent the primary 
data of your study. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
1 Comment 

Author Response 

Alexander Pico, The Gladstone Institutes, UCSF, USA 
Posted: 07 Aug 2014 
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Dear Dr. Goldstein, 

Thank you for the enthusiastic review and suggestions. We have drafted a new conclusions 
paragraph to include in the next revision to the article, which should be posted very soon. 
Coincidentally, we were gearing up to prepare a series of introductory videos for WikiPathways, 
each under 10 minutes long. We will certainly have one covering the use of the WikiPathways app 
for Cytoscape and will link/post it here as well. Given limited time and resources, we might not get 
around to producing the "before" video, but we will describe the process for contrast. In fact, the 
"before" approach is not so clear. There are many routes to transforming an arbitrary XML like 
GPML into a input file for Cytoscape, ranging from the most basic SIF to the full xGMML formats. 
And for each format, one must make judgement calls and compromises, and ideally rely on 
someone with detailed knowledge of the source format. So, that approach could take anywhere 
from a day to a week of scripting depending on the desired accuracy and purpose. And, of course, 
it takes months of planning and programming (like we did) to get the general-purpose app result 
we arrived at. 

Thanks again for highlighting these points and for your suggestions! 
Competing Interests: No competing interests were disclosed. 




Christian A Grove 

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA 



Approved: 02 July 2014 

Referee Report: 02 July 2014 

doi:1 0.5256/f 1 000research.4555.r5286 

This paper is well written, clear and to-the-point. The title is appropriate and the abstract provides 
sufficient clarity and detail. The descriptions of the development, usage, and purpose of the 
WikiPathways App for Cytoscape are sufficiently detailed and informative for a bioinformatician such as 
myself. I cannot speak to the sufficiency of the detail of code and programming descriptions, although it 
seems reasonably complete. I've tested the App as the authors have described and found it to be easy to 
install and use with the set of descriptions and instructions provided on the source website and in this 
paper. All datasets and source code are easily obtainable with the links provided by the authors. 

I have found one small typo in the network/pathway which I have manually changed at the 
WikiPathways.org site: the original network referred to the node of a gene product as "PCL", which 
according to the cited reference and the position in the network is surely meant to represent 
"Phospholipase C" which has the canonical abbreviation of "PLC", in the citation for the network and in 
numerous previous studies. Unless I have made this observation in error, I would recommend changing 
this in the network and associated figures. 



I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 



Page 14 of 15 



FlOOOResearch 



FIOOOResearch 2014, 3:152 Last updated: 19 SEP 2014 



Competing Interests: No competing interests were disclosed. 
1 Comment 

Martina Summer-Kutmon, 

Posted: 02 Jul 2014 

Dear Dr. Grove, 

Thank you for your very fast review. The typo you found is indeed a typo. We will update all figures 
accordingly. Thank you for immediately fixing it on WikiPathways yourself 
(http://wikipathways.org/instance/WP2795). This nicely highlights the efficiency of a wiki-based 
pathway archive. 

Competing Interests: No competing interests were disclosed. 
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